The Gaussian Process Density Sampler
نویسنده
چکیده
The Gaussian process is a useful prior on functions for Bayesian regression and classification. Density estimation with a Gaussian process prior has been difficult, however, due to the requirements that densities be nonnegative and integrate to unity. The statistics community has explored the use of a logistic Gaussian process for density estimation, relying on various methods of approximating the normalization constant (e.g. [1, 4]). We propose the Gaussian Process Density Sampler (GPDS), a nonparametric, practical and consistent method of constructing a Markov chain on the properties of a posterior distribution on an unknown density, without approximation. The GPDS is composed of four parts. The first part is a GP-based prior on density functions. We develop an exchangeable procedure for generating exact samples in data space from a common density drawn from this prior. Second, we show that this prior allows practical inference of specific values of the unnormalized density, using the recently-developed technique of exchange sampling [3]. Third, we extend this MCMC algorithm to draw samples from the predictive distribution on data space that arises when the posterior on density functions is integrated out. This is our primary result. Finally, we demonstrate a sampling procedure for inference of the Gaussian process hyperparameters.
منابع مشابه
The Gaussian Process Density Sampler
We present the Gaussian Process Density Sampler (GPDS), an exchangeable generative model for use in nonparametric Bayesian density estimation. Samples drawn from the GPDS are consistent with exact, independent samples from a fixed density function that is a transformation of a function drawn from a Gaussian process prior. Our formulation allows us to infer an unknown density from data using Mar...
متن کاملKernel Methods for Nonparametric Bayesian Inference of Probability Densities and Point Processes
Nonparametric kernel methods for estimation of probability densities and point process intensities have long been of interest to researchers in statistics and machine learning. Frequentist kernel methods are widely used, but provide only a point estimate of the unknown density. Additionally, in frequentist kernel density methods, it can be difficult to select appropriate kernel parameters. The ...
متن کاملSampling Gaussian Distributions in Krylov Spaces with Conjugate Gradients
This paper introduces a conjugate gradient sampler that is a simple extension of the method of conjugate gradients (CG) for solving linear systems. The CG sampler iteratively generates samples from a Gaussian probability density, using either a symmetric positive definite covariance or precision matrix, whichever is more convenient to model. Similar to how the Lanczos method solves an eigenvalu...
متن کاملThe Infinite Mixture of Infinite Gaussian Mixtures
Dirichlet process mixture of Gaussians (DPMG) has been used in the literature for clustering and density estimation problems. However, many real-world data exhibit cluster distributions that cannot be captured by a single Gaussian. Modeling such data sets by DPMG creates several extraneous clusters even when clusters are relatively well-defined. Herein, we present the infinite mixture of infini...
متن کاملThe Infinite Mixture of Infinite Gaussian Mixtures
Dirichlet process mixture of Gaussians (DPMG) has been used in the literature for clustering and density estimation problems. However, many real-world data exhibit cluster distributions that cannot be captured by a single Gaussian. Modeling such data sets by DPMG creates several extraneous clusters even when clusters are relatively well-defined. Herein, we present the infinite mixture of infini...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008